Data-driven extraction of relative reasoning rules to limit combinatorial explosion in biodegradation pathway prediction
نویسندگان
چکیده
MOTIVATION The University of Minnesota Pathway Prediction System (UM-PPS) is a rule-based expert system to predict plausible biodegradation pathways for organic compounds. However, iterative application of these rules to generate biodegradation pathways leads to combinatorial explosion. We use data from known biotransformation pathways to rationally determine biotransformation priorities (relative reasoning rules) to limit this explosion. RESULTS A total of 112 relative reasoning rules were identified and implemented. In one prediction step, i.e. as per one generation predicted, the use of relative reasoning decreases the predicted biotransformations by over 25% for 50 compounds used to generate the rules and by about 15% for an external validation set of 47 xenobiotics, including pesticides, biocides and pharmaceuticals. The percentage of correctly predicted, experimentally known products remains at 75% when relative reasoning is used. The set of relative reasoning rules identified, therefore, effectively reduces the number of predicted transformation products without compromising the quality of the predictions. AVAILABILITY The UM-PPS server is freely available on the web to all users at the time of submission of this manuscript and will be available following publication at http://umbbd.msi.umn.edu/predict/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
منابع مشابه
The University of Minnesota pathway prediction system: predicting metabolic logic
The University of Minnesota pathway prediction system (UM-PPS, http://umbbd.msi.umn.edu/predict/) recognizes functional groups in organic compounds that are potential targets of microbial catabolic reactions, and predicts transformations of these groups based on biotransformation rules. Rules are based on the University of Minnesota biocatalysis/biodegradation database (http://umbbd.msi.umn.edu...
متن کاملPredicting biodegradation products and pathways: a hybrid knowledge- and machine learning-based approach
MOTIVATION Current methods for the prediction of biodegradation products and pathways of organic environmental pollutants either do not take into account domain knowledge or do not provide probability estimates. In this article, we propose a hybrid knowledge- and machine learning-based approach to overcome these limitations in the context of the University of Minnesota Pathway Prediction System...
متن کاملCritical Reasoning
Model-based diagnosis algorithms face a combinatorial explosion. To combat this explosion, this paper presents a fundamentally new architecture, IMPLODE, which constructs an abstract representation of the environment, the conflict, and the diagnosis spaces using a sensitivity analysis of assumptions. Experimental results show that the most dramatic improvement is obtained for circuits which are...
متن کاملRule-Based Modelling of Cellular Signalling
Modelling is becoming a necessity in studying biological signalling pathways, because the combinatorial complexity of such systems rapidly overwhelms intuitive and qualitative forms of reasoning. Yet, this same combinatorial explosion makes the traditional modelling paradigm based on systems of differential equations impractical. In contrast, agentbased or concurrent languages, such as κ [1,2,3...
متن کاملApplication of the rule extraction method to evaluate seismicity of Iran
Assessing seismic hazards involves specifying the likelihood, magnitude and location of earthquakes in a region. Predicting the seismic hazards is the first step in reducing the impact of the damage caused by an earthquake. In this study, to fully utilize all the known parameters which may possibly affect the occurrence of earthquakes (mb ≥ 4.5); a data-driven rule-extraction method called the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Bioinformatics
دوره 24 18 شماره
صفحات -
تاریخ انتشار 2008